Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Wavelet-Gradient-Fusion for Video Text Binarization

Identifieur interne : 000220 ( Main/Exploration ); précédent : 000219; suivant : 000221

Wavelet-Gradient-Fusion for Video Text Binarization

Auteurs : Sangheeta Roy [Inde] ; Palaiahnakote Shivakumara [Singapour] ; Partha Pratim Roy [France] ; Chew Lim Tan [Singapour]

Source :

RBID : Hal:hal-01027441

Abstract

Achieving good character recognition rate in video images is not as easy as achieving the same from the scanned documents because of low resolution and complex background in video images. In this paper, we propose a new method using fusion of horizontal, vertical and diagonal information obtained by the wavelet and the gradient on text line images to enhance the text information. We apply k-means with k=2 on row-wise and column-wise pixels separately to extract possible text information. The union operation on row-wise and column-wise clusters provides the text candidates information. With the help of Canny of the input image, the method identifies the disconnections based on mutual nearest neighbor criteria on end points and it compares the disconnected area with the text candidates to restore the missing information. Next, the method uses connected component analysis to merge some subcomponents based on nearest neighbor criteria. The foreground (text) and background (non-text) is separated based on new observation that the color values at edge pixel of the components are larger than the color values of the pixel inside the component. Finally, we use Google Tesseract OCR to validate our results and the results are compared with the baseline thresholding techniques to show that the proposed method is superior to existing methods in terms of recognition rate on 236 video and 258 ICDAR 2003 text lines. Keywords- Wavelet-Gradient-Fusion, Video text lines, Video Video text restoration, Video character rcognition

Url:


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Wavelet-Gradient-Fusion for Video Text Binarization</title>
<author>
<name sortKey="Roy, Sangheeta" sort="Roy, Sangheeta" uniqKey="Roy S" first="Sangheeta" last="Roy">Sangheeta Roy</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-261692" status="INCOMING">
<orgName>Tata consultancy services</orgName>
<desc>
<address>
<country key="IN"></country>
</address>
</desc>
<listRelation>
<relation active="#struct-362131" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-362131" type="direct">
<org type="institution" xml:id="struct-362131" status="INCOMING">
<orgName>Tata consultancy services</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>Inde</country>
</affiliation>
</author>
<author>
<name sortKey="Shivakumara, Palaiahnakote" sort="Shivakumara, Palaiahnakote" uniqKey="Shivakumara P" first="Palaiahnakote" last="Shivakumara">Palaiahnakote Shivakumara</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-232345" status="INCOMING">
<orgName>School of Computing</orgName>
<desc>
<address>
<country key="SG"></country>
</address>
</desc>
<listRelation>
<relation active="#struct-301111" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-301111" type="direct">
<org type="institution" xml:id="struct-301111" status="VALID">
<orgName>National University of Singapore</orgName>
<orgName type="acronym">NUS</orgName>
<desc>
<address>
<addrLine>21 Lower Kent Ridge Rd, Singapour 119077</addrLine>
<country key="SG"></country>
</address>
<ref type="url">http://www.nus.edu.sg/</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>Singapour</country>
<orgName type="university">Université nationale de Singapour</orgName>
</affiliation>
</author>
<author>
<name sortKey="Roy, Partha Pratim" sort="Roy, Partha Pratim" uniqKey="Roy P" first="Partha Pratim" last="Roy">Partha Pratim Roy</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-204893" status="VALID">
<orgName>Laboratoire d'Informatique de l'Université de Tours</orgName>
<orgName type="acronym">LI</orgName>
<desc>
<address>
<addrLine>64, Avenue Jean Portalis, 37200 Tours</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.li.univ-tours.fr/</ref>
</desc>
<listRelation>
<relation active="#struct-300408" type="direct"></relation>
<relation name="EA6300" active="#struct-300298" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-300408" type="direct">
<org type="institution" xml:id="struct-300408" status="VALID">
<orgName>Polytech'Tours</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
<tutelle name="EA6300" active="#struct-300298" type="direct">
<org type="institution" xml:id="struct-300298" status="VALID">
<orgName>Université François Rabelais - Tours</orgName>
<desc>
<address>
<addrLine>60 rue du Plat d'Étain, 37020 Tours cedex 1 </addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-tours.fr</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
<placeName>
<settlement type="city">Tours</settlement>
<region type="old region" nuts="2">Région Centre</region>
<region type="region" nuts="2">Centre-Val de Loire</region>
</placeName>
<orgName type="university">Université François-Rabelais de Tours</orgName>
<orgName type="institution" wicri:auto="newGroup">Centre Val de Loire Université</orgName>
</affiliation>
</author>
<author>
<name sortKey="Tan, Chew Lim" sort="Tan, Chew Lim" uniqKey="Tan C" first="Chew Lim" last="Tan">Chew Lim Tan</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-232345" status="INCOMING">
<orgName>School of Computing</orgName>
<desc>
<address>
<country key="SG"></country>
</address>
</desc>
<listRelation>
<relation active="#struct-301111" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-301111" type="direct">
<org type="institution" xml:id="struct-301111" status="VALID">
<orgName>National University of Singapore</orgName>
<orgName type="acronym">NUS</orgName>
<desc>
<address>
<addrLine>21 Lower Kent Ridge Rd, Singapour 119077</addrLine>
<country key="SG"></country>
</address>
<ref type="url">http://www.nus.edu.sg/</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>Singapour</country>
<orgName type="university">Université nationale de Singapour</orgName>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">HAL</idno>
<idno type="RBID">Hal:hal-01027441</idno>
<idno type="halId">hal-01027441</idno>
<idno type="halUri">https://hal.archives-ouvertes.fr/hal-01027441</idno>
<idno type="url">https://hal.archives-ouvertes.fr/hal-01027441</idno>
<date when="2012-11-11">2012-11-11</date>
<idno type="wicri:Area/Hal/Corpus">000130</idno>
<idno type="wicri:Area/Hal/Curation">000130</idno>
<idno type="wicri:Area/Hal/Checkpoint">000059</idno>
<idno type="wicri:Area/Main/Merge">000224</idno>
<idno type="wicri:Area/Main/Curation">000220</idno>
<idno type="wicri:Area/Main/Exploration">000220</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en">Wavelet-Gradient-Fusion for Video Text Binarization</title>
<author>
<name sortKey="Roy, Sangheeta" sort="Roy, Sangheeta" uniqKey="Roy S" first="Sangheeta" last="Roy">Sangheeta Roy</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-261692" status="INCOMING">
<orgName>Tata consultancy services</orgName>
<desc>
<address>
<country key="IN"></country>
</address>
</desc>
<listRelation>
<relation active="#struct-362131" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-362131" type="direct">
<org type="institution" xml:id="struct-362131" status="INCOMING">
<orgName>Tata consultancy services</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>Inde</country>
</affiliation>
</author>
<author>
<name sortKey="Shivakumara, Palaiahnakote" sort="Shivakumara, Palaiahnakote" uniqKey="Shivakumara P" first="Palaiahnakote" last="Shivakumara">Palaiahnakote Shivakumara</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-232345" status="INCOMING">
<orgName>School of Computing</orgName>
<desc>
<address>
<country key="SG"></country>
</address>
</desc>
<listRelation>
<relation active="#struct-301111" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-301111" type="direct">
<org type="institution" xml:id="struct-301111" status="VALID">
<orgName>National University of Singapore</orgName>
<orgName type="acronym">NUS</orgName>
<desc>
<address>
<addrLine>21 Lower Kent Ridge Rd, Singapour 119077</addrLine>
<country key="SG"></country>
</address>
<ref type="url">http://www.nus.edu.sg/</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>Singapour</country>
<orgName type="university">Université nationale de Singapour</orgName>
</affiliation>
</author>
<author>
<name sortKey="Roy, Partha Pratim" sort="Roy, Partha Pratim" uniqKey="Roy P" first="Partha Pratim" last="Roy">Partha Pratim Roy</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-204893" status="VALID">
<orgName>Laboratoire d'Informatique de l'Université de Tours</orgName>
<orgName type="acronym">LI</orgName>
<desc>
<address>
<addrLine>64, Avenue Jean Portalis, 37200 Tours</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.li.univ-tours.fr/</ref>
</desc>
<listRelation>
<relation active="#struct-300408" type="direct"></relation>
<relation name="EA6300" active="#struct-300298" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-300408" type="direct">
<org type="institution" xml:id="struct-300408" status="VALID">
<orgName>Polytech'Tours</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
<tutelle name="EA6300" active="#struct-300298" type="direct">
<org type="institution" xml:id="struct-300298" status="VALID">
<orgName>Université François Rabelais - Tours</orgName>
<desc>
<address>
<addrLine>60 rue du Plat d'Étain, 37020 Tours cedex 1 </addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-tours.fr</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
<placeName>
<settlement type="city">Tours</settlement>
<region type="old region" nuts="2">Région Centre</region>
<region type="region" nuts="2">Centre-Val de Loire</region>
</placeName>
<orgName type="university">Université François-Rabelais de Tours</orgName>
<orgName type="institution" wicri:auto="newGroup">Centre Val de Loire Université</orgName>
</affiliation>
</author>
<author>
<name sortKey="Tan, Chew Lim" sort="Tan, Chew Lim" uniqKey="Tan C" first="Chew Lim" last="Tan">Chew Lim Tan</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-232345" status="INCOMING">
<orgName>School of Computing</orgName>
<desc>
<address>
<country key="SG"></country>
</address>
</desc>
<listRelation>
<relation active="#struct-301111" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-301111" type="direct">
<org type="institution" xml:id="struct-301111" status="VALID">
<orgName>National University of Singapore</orgName>
<orgName type="acronym">NUS</orgName>
<desc>
<address>
<addrLine>21 Lower Kent Ridge Rd, Singapour 119077</addrLine>
<country key="SG"></country>
</address>
<ref type="url">http://www.nus.edu.sg/</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>Singapour</country>
<orgName type="university">Université nationale de Singapour</orgName>
</affiliation>
</author>
</analytic>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass></textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Achieving good character recognition rate in video images is not as easy as achieving the same from the scanned documents because of low resolution and complex background in video images. In this paper, we propose a new method using fusion of horizontal, vertical and diagonal information obtained by the wavelet and the gradient on text line images to enhance the text information. We apply k-means with k=2 on row-wise and column-wise pixels separately to extract possible text information. The union operation on row-wise and column-wise clusters provides the text candidates information. With the help of Canny of the input image, the method identifies the disconnections based on mutual nearest neighbor criteria on end points and it compares the disconnected area with the text candidates to restore the missing information. Next, the method uses connected component analysis to merge some subcomponents based on nearest neighbor criteria. The foreground (text) and background (non-text) is separated based on new observation that the color values at edge pixel of the components are larger than the color values of the pixel inside the component. Finally, we use Google Tesseract OCR to validate our results and the results are compared with the baseline thresholding techniques to show that the proposed method is superior to existing methods in terms of recognition rate on 236 video and 258 ICDAR 2003 text lines. Keywords- Wavelet-Gradient-Fusion, Video text lines, Video Video text restoration, Video character rcognition</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>France</li>
<li>Inde</li>
<li>Singapour</li>
</country>
<region>
<li>Centre-Val de Loire</li>
<li>Région Centre</li>
</region>
<settlement>
<li>Tours</li>
</settlement>
<orgName>
<li>Centre Val de Loire Université</li>
<li>Université François-Rabelais de Tours</li>
<li>Université nationale de Singapour</li>
</orgName>
</list>
<tree>
<country name="Inde">
<noRegion>
<name sortKey="Roy, Sangheeta" sort="Roy, Sangheeta" uniqKey="Roy S" first="Sangheeta" last="Roy">Sangheeta Roy</name>
</noRegion>
</country>
<country name="Singapour">
<noRegion>
<name sortKey="Shivakumara, Palaiahnakote" sort="Shivakumara, Palaiahnakote" uniqKey="Shivakumara P" first="Palaiahnakote" last="Shivakumara">Palaiahnakote Shivakumara</name>
</noRegion>
<name sortKey="Tan, Chew Lim" sort="Tan, Chew Lim" uniqKey="Tan C" first="Chew Lim" last="Tan">Chew Lim Tan</name>
</country>
<country name="France">
<region name="Région Centre">
<name sortKey="Roy, Partha Pratim" sort="Roy, Partha Pratim" uniqKey="Roy P" first="Partha Pratim" last="Roy">Partha Pratim Roy</name>
</region>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000220 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 000220 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     Hal:hal-01027441
   |texte=   Wavelet-Gradient-Fusion for Video Text Binarization
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024